Data Mining in the Analysis of Tree Harvester Performance Based on Automatically Collected Data

نویسندگان

چکیده

Data recorded automatically by harvesters are a promising and potentially very useful source of information for scientific analyses. Most researchers have used StanForD files this purpose, but these troublesome to obtain require some pre-processing. This study utilized new similar data: JDLink, cloud-based service, run the machine manufacturer, that stores data from sensors in real time. The vast amount such makes it hard comprehend handle efficiently. mining techniques assist finding trends patterns databases. Records two mid-sized working north-eastern Poland were analyzed using classical regression (linear logarithmic), cluster analysis (dendrograms k-means) Principal Component Analysis (PCA). Linear showed average tree size was variable having greatest effect on fuel consumption per cubic meter productivity, whereas hour also dependent, e.g., distance driven low gear or share time with high engine load. Results clustering PCA harder interpret. Dendrograms most dissimilar variables: total volume harvested day, day work revolutions minute (RPMs). K-means allowed us identify periods when specific clusters variables more prominent. results, despite explaining almost 90% variance, inconclusive between machines, and, therefore, need be scrutinized follow-up studies. Productivity values (avg. around 10 m3/h) rates (13.21 L/h, 1.335 L/m3 average) results reported other authors under comparable conditions. Some measures obtained include, (around 7 km day) proportion running low, medium load (34%, 39% 7%, respectively). assumption use without supplementing external sources, as little processing possible, which limited analytic methods unsupervised learning. Extending database studies will facilitate application supervised learning modeling prediction.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

the clustering and classification data mining techniques in insurance fraud detection:the case of iranian car insurance

با توجه به گسترش روز افزون تقلب در حوزه بیمه به خصوص در بخش بیمه اتومبیل و تبعات منفی آن برای شرکت های بیمه، به کارگیری روش های مناسب و کارآمد به منظور شناسایی و کشف تقلب در این حوزه امری ضروری است. درک الگوی موجود در داده های مربوط به مطالبات گزارش شده گذشته می تواند در کشف واقعی یا غیرواقعی بودن ادعای خسارت، مفید باشد. یکی از متداول ترین و پرکاربردترین راه های کشف الگوی داده ها استفاده از ر...

data mining rules and classification methods in insurance: the case of collision insurance

assigning premium to the insurance contract in iran mostly has based on some old rules have been authorized by government, in such a situation predicting premium by analyzing database and it’s characteristics will be definitely such a big mistake. therefore the most beneficial information one can gathered from these data is the amount of loss happens during one contract to predicting insurance ...

15 صفحه اول

a study on insurer solvency by panel data model: the case of iranian insurance market

the aim of this thesis is an approach for assessing insurer’s solvency for iranian insurance companies. we use of economic data with both time series and cross-sectional variation, thus by using the panel data model will survey the insurer solvency.

A new approach based on data envelopment analysis with double frontiers for ranking the discovered rules from data mining

Data envelopment analysis (DEA) is a relatively new data oriented approach to evaluate performance of a set of peer entities called decision-making units (DMUs) that convert multiple inputs into multiple outputs. Within a relative limited period, DEA has been converted into a strong quantitative and analytical tool to measure and evaluate performance. In an article written by Toloo et al. (2009...

متن کامل

Data sanitization in association rule mining based on impact factor

Data sanitization is a process that is used to promote the sharing of transactional databases among organizations and businesses, it alleviates concerns for individuals and organizations regarding the disclosure of sensitive patterns. It transforms the source database into a released database so that counterparts cannot discover the sensitive patterns and so data confidentiality is preserved ag...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Forests

سال: 2023

ISSN: ['1999-4907']

DOI: https://doi.org/10.3390/f14010165